SELF TRAINING AND ENSEMBLING FREQUENCY DEPENDENT NETWORKS WITH COARSE PREDICTION POOLING AND SOUND EVENT BOUNDING BOXES
https://dcase.community/documents/challenge2024/technical_reports/DCASE2024_Nam_38_t4.pdf
周波数の特性に着目 > FreDNets
ブランチが三つ > ATST,BEATs,CNN
2節にモデルの説明があるっぽい
長いので後回し
https://gyazo.com/1fd7c6f43249836fd75fda512ad5f6cf